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Abstract A population protocol stably elects a leader 
if, for all n, starting from an initial configuration 
with n agents each in an identical state, with prob¬ 
ability 1 it reaches a configuration y that is correct 
(exactly one agent is in a special leader state £) and 
stable (every configuration reachable from y also has 
a single agent in state £). We show that any popu¬ 
lation protocol that stably elects a leader requires 
n{n) expected “parallel time”—17 (n^) expected to¬ 
tal pairwise interactions—to reach such a stable con¬ 
figuration. Our result also informs the understanding 
of the time complexity of chemical self-organization 
by showing an essential difficulty in generating exact 
quantities of molecular species quickly. 
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1 Introduction 

Background. Population protocols (PPs) were intro¬ 
duced by Angluin, Aspnes, Diamadi, Fischer, and 
Peralta [4] as a model of distributed computing in 
which the agents have very little computational power 
and no control over their schedule of interaction with 
other agents. They also can be thought of as a spe¬ 
cial case of Petri nets and vector addition systems 
[21, 22], which were introduced in the 1960s as a 
model of concurrent processing. In addition to be¬ 
ing an appropriate model for electronic computing 
scenarios such as mobile sensor networks, they are a 
useful abstraction of “fast-mixing” physical systems 
such as animal populations [24], chemical reaction 
networks, and gene regulatory networks [12]. 

A PP is defined by a finite set A of states that 
each agent may have,^ together with a transition 
functions : Ax A ^ AxA."^ Given states ri, r 2 ,pi,P 2 G 
d, if S{ri,r 2 ) = (pi,P 2 ) (denoted ri,r 2 -)> Pi,P 2 ) 

1 Some recent work on PPs [1-3,10,11,20] allows the 
number of states to grow with the number of agents. This 
paper uses the original model [4] with state set that is 
constant with respect to the population size. (See section 
“Related work”.) 

^ Some work allows “non-deterministic” transitions, in 
which the transition function maps to subsets of Ax A. Our 
results are independent of whether the transition function 
is deterministic or nondeterministic in this manner. 
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and a pair of agents in respective states ri and r 2 
interact, then their states become pi and p 2 -^ A con¬ 
figuration of a PP is a vector c G describing, for 
each state s G A, the count c(s) of how many agents 
are in state s. Executing a transition ri,r 2 — )> Pi,P 2 
alters the configuration by decrementing the counts 
of ri and r 2 by 1 each and incrementing pi and p 2 
by 1 each. Possibly some of ri,r 2 ,Pi,P 2 are equal to 
each other, so the count of a state could change by 
0, 1, or 2. 

Associated with a PP is a set of valid initial 
configurations that we expect the PP to be able 
to handle.^ Agents interact in a pairwise manner 
and change state based on the transition function. 
The next pair of agents to interact is chosen uni¬ 
formly at random among the n agents. If no tran¬ 
sition rule applies, the interaction is a “null tran¬ 
sition” ri, r 2 —^ ri, r 2 , in which the agents interact 
but don’t change state. We count the expected num¬ 
ber of interactions until some event occurs, and then 
define the “parallel time” until this event as the ex¬ 
pected number of interactions divided by the num¬ 
ber of agents n. This measure of time is based on the 
natural parallel model where each agent participates 
in a constant number of interactions in one unit of 
time, hence 0{n) total interactions are expected per 
unit time [6]. In this paper all references to “time” 
refer to parallel time. 

In order to define error-free computation in PPs, 
we rely on to the model of stable computation [8]. 
The model defines computation to be complete when 
the PP gets to a configuration that is correct® and 
“stable” in the sense that no subsequent sequence of 
transitions can take the PP to an incorrect config¬ 
uration. The model of stable computation disallows 
error even in an “adversarial” schedule of transitions: 
we require that from every configuration reachable 
by any sequence of transitions from the initial con¬ 
figuration, it is possible to reach to a correct sta¬ 
ble configuration. Since the configuration space is 

® In the most generic model, there is no restriction on 
which agents are permitted to interact. If one prefers to 
think of the agents as existing on nodes of a graph, then 
it is the complete graph Kn for a population of n agents. 

^ The set of valid initial configurations for a “self- 
stabilizing” PP is N^, where leader election is provably 
impossible [9]. We don’t require the PP to work if started 
in any possible configuration, but rather allow potentially 
“helpful” initial configurations as long as they don’t al¬ 
ready have small count states (see “o-dense” below). 

® What “correct” means depends on the task. For com¬ 
puting a predicate, for example, A is partitioned into “yes” 
and “no” voters, and a “correct” configuration is one in 
which every state present has the correct vote. 


finite, stable computation is equivalent to requiring, 
under the randomized model, that a correct stable 
configuration is reached with probability 1. Thus, al¬ 
though it may appear at first glance that correctness 
and expected time are defined with respect to differ¬ 
ent models of transition sequences—adversarial vs 
random—due to this equivalence they are both seen 
to be definable in the randomized model. This notion 
of stable computation is also equivalent to requiring 
that every fair sequence of transitions reaches a cor¬ 
rect stable configuration, where “fair” means that 
every configuration infinitely often reachable is in¬ 
finitely often reached [8]. For the arguments of this 
paper, the most convenient definition of stable com¬ 
putation is the first one, combinatorial in terms of 
reachability. 

A PP works “with a leader” if there is a special 
“leader” state £, and every valid initial configuration 
i satisfies i(f') = I. This is in contrast to a uniform 
initial configuration (i(a:) = n for some state x and 
i{y) = 0 for all states y x) oi an initial configu¬ 
ration only encoding the input (i(a::i) = Ui for i G 
{I, ... ,k} to represent any input (ni, 712 ,..., Uk) G 
N^). It is known that the predicates </> : N*’ —)• {0,1} 
stably computable by PPs are exactly the semilin- 
ear predicates, whether an initial leader is allowed 
or not [8]. Although the initial leader does not al¬ 
ter the class of computable predicates, it may allow 
faster computation. For example, the fastest known 
PPs to stably compute semilinear predicates with¬ 
out a leader take as long as 0(n) to converge.® In 
contrast, with a leader, it is known that any semilin¬ 
ear predicate can be stably computed with expected 
convergence time 0(log®n) [6]. Thus, in certain cir¬ 
cumstances, the presence of a initial leader seems to 
give PPs more computational power (e.g., to con¬ 
verge quickly). Angluin, Aspnes, and Eisenstat [6] 
asked whether polylogarithmic time stable compu¬ 
tation of semilinear predicates is possible without a 
leader; absent a positive answer, the presence of a 
leader appears to add power to the model. 

Statement of main result. Motivated in part by the 
apparent speedup possible with an initial leader, we 
ask how quickly a leader may be elected from a con¬ 
figuration lacking one. We pose the problem as fol¬ 
lows: design a PP V with two special states x (the 
initial state) and £ (the leader state, which may or 
may not be identical to x) such that, for every n G N, 

® See “Open questions” for the distinction between time 
to converge and time to stabilize. In this paper, the time 
lower bound we prove is on stabilization. 
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from the initial configuration defined as i„(a:) = n 
and in(y) = 0 for all other states y, has the following 
property. For every configuration c reachable from 
in,, there is a configuration y reachable from c that 
has a stable leader, meaning that in all configurations 
y' reachable from y (including y itself), y'(^) = 1. 

There is a simple 0{n) expected time PP for sta¬ 
ble leader election, with (assuming x = tj the sin¬ 
gle transition £,£ —>■ £,f. Our main theorem shows 
that every PP that stably elects a leader requires 
time J7(n) to reach a conhguration with a stable 
leader; thus the previous PP is asymptotically opti¬ 
mal. Section 3.3 discusses why some straightforward 
approaches to proving a time lower bound for leader 
election fail. 

Composing leader election with other tasks. We have 
motivated the importance of leader election in pop¬ 
ulation protocols, in part, by reference to tasks that 
seem to require a leader to be present initially. How¬ 
ever, a fast leader election protocol would only al¬ 
leviate the need for an initial leader if the election 
could be composed with the subsequent task. Sim¬ 
ply combining all transitions for the leader election 
with the transitions for the task does not necessarily 
work: prior to the conclusion of the leader election, 
the presence of multiple leaders may result in unin¬ 
tended transitions. 

A number of ad-hoc schemes successfully com¬ 
pose leader election with computation that requires 
a leader. For example, in ref. [4], the stable com¬ 
putation of the “remainder protocol” depends on 
leader election to stabilize on a single leader which 
collects the remainder information. As leaders drop 
out they transfer their information to the surviving 
leaders. However, to facilitate composition of leader 
election with a subsequent task, it would be easiest 
to have a leader election protocol with a stronger ter¬ 
mination criterion than the “stabilizing” criterion we 
study, namely the “terminating” criterion in which 
the leader “knows” when it has been elected, and 
only then would it trigger the subsequent task to 
begin. One way to formalize “knowing when it has 
been elected” is to require that the system never has 
more than one leader.^ However, it is simple to prove 

^ If it were possible to detect when a leader election pro¬ 
tocol such a,s 1,1 ^ i, f has stabilized, in the sense that 
each agent carries a bit {g, s} in which all agents start with 
g and only transition to s after a single £ exists, then one 
could consider the product state {£, s) to be the “true” 
leader state, and {£, g) is considered only a “candidate” 
leader state that may have count > 1 prior to the stabi¬ 
lization to a single £. 


that leader election with this terminating conven¬ 
tion is impossible: Dividing the initial population in 
half and preventing the two halves from interacting, 
each half (being a valid initial configuration itself) 
must elect a separate leader, violating the require¬ 
ment that its count never exceeds 1. 

Thus the question of how to systematically com¬ 
pose leader election with arbitrary downstream com¬ 
putation is itself open. Nevertheless, we show that 
even if the protocol follows the more liberal stabiliz¬ 
ing criterion, in which the leader need not “know” 
when it has been elected, stabilization to a single 
leader still requires linear time. 

Multiple leader states, multiple leaders, and other 
initial configurations. A more general notion of leader 
election is to identify a subset iF C A of states that 
are all considered leader states, and to require the 
PP to eventually reach a configuration y in which 
^^g,^y(£) = 1, and this sum is 1 in every configu¬ 
ration reachable from y. This corresponds more ap¬ 
propriately to how leader states actually coordinate 
computation in PPs: a leader agent must remember 
some state information in between transitions (hence 
it changes state while remaining the unique leader). 
Our techniques actually show this stronger result as 
well (as explained in Section 3.2). Further, our re¬ 
sult implies that a PP cannot elect any fixed quan¬ 
tity of leaders (e.g. exactly 256) or variable quantity 
of leaders under a fixed bound (e.g. at most 256) in 
sublinear expected time. 

In the simplest formulation of the task of leader 
election, we always start with n agents in state x 
(as described above). Can we capture more gener¬ 
ally leader election from a configuration “without a 
pre-existing leader” ? Intuitively, we want to exclude 
initial configurations with states present in small but 
non-zero count. We can exclude such initial config¬ 
urations, but allow otherwise deliberately prepared 
starting conditions, using the notion of a-dense con¬ 
figurations: any state present in the initial config¬ 
uration has count > an. Our general negative re¬ 
sult (Theorem 3.8) implies that even starting with 
best-case initial configurations, as long as, for some 
constant a > 0, they are all a-dense, sublinear time 
leader election is impossible. An open question re¬ 
lates to weakening the notion of a-dense (see below). 

Chemical reaction networks. The main result and 
proof are stated in the language of PPs; however, the 
result holds for more general systems that have PPs 
as a special case. The discrete, stochastic chemical 
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reaction network (CRN) model has been extensively 
used in the natural sciences to model chemical kinet¬ 
ics in a well-mixed solution [19]. The CRN model is 
also used prescriptively for specifying the behavior of 
synthetic chemical systems [14,23]. A CRN is a finite 
set of species (corresponding to PP states) such as 
A, Y, Z, and reactions (corresponding to PP transi¬ 
tions) such asX + Y^ZorY^ 2X -|- Z. CRNs can 
be thought of as a generalization of PPs in which 
spontaneous transitions are possible (unimolecular 
reactions), the transition may cause the number of 
agents to change (if the reaction has a different num¬ 
ber of products than reactants), and each transition 
has an associated constant k that affects its proba¬ 
bility of being selected. 

As an essential form of self-organization, biolog¬ 
ical cells seem able to precisely control the count 
of certain molecules (centriole number [15] is a well 
studied example). How chemical systems transform 
relatively uncontrolled initial conditions to precisely 
controlled amounts of desired species is still not well 
understood. Our negative result applied to CRNs® 
implies that generating with probability 1 an exact 
positive count of a certain species, whether 1 or 256, 
is necessarily slower (l7(n) time) than, for example, 
destroying all molecules of the species (through the 
reaction X 0), which takes O(logn) time. 

Open questions. Although we measure computation 
time with respect to stabilization—the ultimate goal 
of stable computation—some work uses a different 
goalpost for completion. Consider a PP stably elect¬ 
ing a leader, and one particular transition sequence 
that describes its history. We can say the transition 
sequence converged at the point when the count of 
the leader is the same in every subsequently reached 
configuration (if the PP is correct, this count should 
be 1). In contrast, recall that the point of stabiliza¬ 
tion is when the count of the leader is the same in 
every subsequently reachable configuration (whether 
actually reached in the transition sequence or not). 
Measuring time to stabilization in the randomized 
model, as we do here, measures the expected time 
until the probability of changing the output becomes 
0. To help illustrate the difference between these two 


® Our result holds for any CRN that obeys Theorem 4.3, 
the precise constraints of which are specified in [17] (those 
constraints automatically apply to all PPs). Importantly, 
to generalize to CRNs, we never assume that the count 
of agents is fixed, but rather use n to indicate the initial 
count. 


subtly different concepts. Section 3.3 shows some ex¬ 
amples of PPs that converge before stabilizing. 

Our proof shows only that stabilization must take 
expected f7(n) time. However, convergence could oc¬ 
cur much earlier in a transition sequence than stabi¬ 
lization. We leave as an open question whether there 
is a PP that stably elects a leader and converges in 
expected o(n) time. We reiterate that there are PPs 
that work with a leader to stably compute semilin- 
ear predicates with convergence time 0(log® n) [6]. 
Thus if stable leader election can converge in ex¬ 
pected sublinear time, by coupling the two PPs it 
might be possible to achieve stable computation of 
arbitrary semilinear predicates with sublinear con¬ 
vergence time. 

It should be noted that the optimal stabiliza¬ 
tion time for stably computing semilinear predicates, 
even with an initial leader, is still an open question. 
The stably computing PPs converging in 0(log® n) 
time [6] provably require expected time f2{n) to sta¬ 
bilize, and it is unknown whether faster stabilization 
is possible. 

Going beyond stable computation, the open ques¬ 
tion of Angluin, Aspnes, and Eisenstat [6] asks whether 
their efficient high-probability simulation of a space- 
bounded Turing machine by a PP could remove the 
assumption of an initial leader. That simulation has 
some small probability e > 0 of failure, so if one 
could elect a leader with a small probability e' > 0 
of error and subsequently use it to drive the simu¬ 
lation, by the union bound the total probability of 
error would be at most e -I- e' (i.e., still close to 0). 
However, it remains an open question whether the 
necessary PP exists. 

Our general negative result applies to a-dense 
initial configurations. However, is sublinear time sta¬ 
ble leader election possible from other kinds of ini¬ 
tial configurations that satisfy our intuition of not 
having preexisting leaders? It is known, for exam¬ 
ple, that for each 0 < e < 1, an initial configuration 
with 0(n) agents in one state and 0{n^) in another 
state can elect a leader in expected time 0(log^ n) 
with high probability [6] , although this protocol has 
a positive probability of failure. In Section 3.3 we 
give an example PP that stably elects a leader in 
0 (n^/^ log n) time starting from an initial configu¬ 
ration with 0{n) agents in one state and 0{n}/'^) in 
another state. In general we want to better charac¬ 
terize the initial configurations for which sublinear 
time leader election is possible. 
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Related work. Alistarh and Gelashvili [2] showed that 
relaxing the requirement of 0(1) states to O(log^n) 
states allows for a leader to be stably elected in ex¬ 
pected time O(log^n). (Indeed, our proof technique 
fails if the number of states is not constant with re¬ 
spect to n.) Alistarh, Aspnes, Eisenstat, Gelashvili, 
and Rivest [1] have further refined our understand¬ 
ing of PPs with non-constant states by showing: (1) 
a time complexity lower bound: even with up to 
O(loglogn) states, any stable leader election pro¬ 
tocol still requires “near linear” (f2(n/polylog n)) 
expected time to stabilize, and (2) a time complex¬ 
ity upper bound: reducing the O(log'^n) state re¬ 
quirement of [2] to 0(log^ n), at the cost of requiring 
0(log®n) expected time to stabilize. They are fur¬ 
thermore able to apply these techniques to show sim¬ 
ilar lower and upper bounds for the majority prob¬ 
lem [7], where the initial majority among a popula¬ 
tion of states X and y should eventually occupy the 
whole population. 

Whether a constant bound on the number of states 
is appropriate depends upon the situation being mod¬ 
eled by the PP. In some settings—e.g., sensor networks- 
it is reasonable that employing larger “swarms” may 
be helped by slightly increasing the memory per sen¬ 
sor (say, logarithmically with n). However, when mod¬ 
eling biological regulatory networks, for example, each 
state corresponds to an existing chemical species, 
and 0(1) states is natural. 


2 Preliminaries 

If A is a finite set (in this paper, of states), we 
write to denote the set of functions c : A —>■ N. 
Equivalently, we view an element c £ as a vec¬ 
tor of |A| nonnegative integers, with each coordi¬ 
nate “labeled” by an element of A. Given s £ A 
and c £ we refer to c(s) as the eount of s in 
c. Let ||c|| = ||c||i = EsGyic(s) denote the total 
number of agents. We write c < c' to denote that 
c(s) < c'(s) for all s £ A. Since we view vectors 
c £ equivalently as multisets of elements from 
A, if c < c' we say c is a subset of c'. It is some¬ 
times convenient to use multiset notation to denote 
vectors, e.g., {x,x,y} and {2x,y} both denote the 
vector c defined by c(a:) = 2, c(y) = 1, and c{z) = 0 
for all z ^ {x,y}. Given c,c' £ N^, we define the 
vector component-wise operations of addition c-f c', 
subtraction c — c', and scalar multiplication me for 
TO € N. For a set Z\ c A, we view a vector c £ 


equivalently as a vector c £ by assuming c(s) = 0 
for all s € A \ Z\. 

The following lemma is used frequently in rea¬ 
soning about population protocols. 

Lemma 2.1 (Dickson’s Lemma [16]) Any infi¬ 
nite sequence xo,xi,... £ has an infinite non¬ 
decreasing subsequence Xig < x^^ < ..., where io < 
ii < ... e N. 


2.1 Population Protocols 

A population protocol (PP) is a pair V = (A, <5),® 
where A is a finite set of states, and i5:AxA—^-AxA 
is the (symmetric) transition function. A configura¬ 
tion of a PP is a vector c £ N^, with the interpre¬ 
tation that c(s) agents are in state s. By conven¬ 
tion, the value n £ 'lA represents the total num¬ 
ber of agents ||c||. A transition is a 4-tuple a = 
(ri,r 2 ,pi,p 2 ) £ A^, written a : ri,r 2 -P- pi,p 2 , such 
that 6 {ri,r 2 ) = (pi,p 2 ). This paper typically defines 
a PP by a list of transitions, with 5 implicit. If an 
agent in state ri interacts with an agent in state r 2 , 
then they change states to pi and p 2 . For every pair 
of states ri, r 2 without an explicitly listed transition 
ri,r 2 —>■ pi,p 2 , there is an implicit null transition 
ri, r 2 ri, r 2 in which the agents interact but do 
not change state. 

More formally, given a configuration c and tran¬ 
sition a : ri,r 2 pi,p 2 , we say that a is applicable 
to c if c > {ri,r 2 }, i.e., c contains 2 agents, one 
in state ri and one in state r 2 . If a is applicable 
to c, then write a{c) to denote the configuration 
c—{ri, r2}-I-{pi,p2} (i-e., the configuration that re¬ 
sults from applying a to c); otherwise a(c) is unde¬ 
fined. A finite or infinite sequence of transitions (a^) 
is a transition sequence (or path). Applying a finite 
or infinite transition sequence (a^) starting at config¬ 
uration Co induces a finite or infinite sequence of con¬ 
figurations (co,Ci,...) such that, for all c^ {i > 1), 

® We give a slightly different formalism than that of [8] 
for population protocols. The main difference is that since 
we are not deciding a predicate, there is no notion of in¬ 
puts being mapped to states or states being mapped to 
outputs. Another difference is that we assume the tran¬ 
sition function is symmetric (so there is no notion of a 
“sender” and “receiver” agent as in [8] ; the unordered pair 
of states completely determines the next pair of states). 
However, the results of this paper hold even if we allow 
the transition function to be non-symmetric or even to be 
non-deterministic (allowing transitions such as a,b ^ c,d 
and a,b ^ x,y to coexist). 
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Ci = If a finite transition sequence q, 

when applied to the starting configuration c, ends 
with c', we write c =^q c' . We write c => c' if such 
a transition sequence exists (i.e., it is possible for the 
system to reach from c to c') and we say that c' is 
reachable from c. If it is understood from context 
what is the initial configuration i, then say c is sim¬ 
ply reachable if i c. Note that this notation omits 
mention of V; we always deal with a single PP at a 
time, so it is clear from context which PP is defining 
the transitions. If a transition a : ri, r 2 pi^p 2 has 
the property that for i G {1,2}, ^ {pi,P 2 }, or if 

(ri = r 2 and {ri ^ pi or ri ^ P2)), then we say that 
a consumes ri. In other words, applying a reduces 
the count of Vi. We similarly say that a produces pi 
if it increases the count of pi. 

2.2 Time Complexity 

In any configuration the next interaction is chosen by 
selecting a pair of agents uniformly at random and 
applying transition function S. To measure time we 
count the expected total number of interactions (in¬ 
cluding null transitions such as a, & —>■ a, & in which 
the agents interact but do not change state), and 
divide by the number of agents n. (In the popula¬ 
tion protocols literature, this is often called “par¬ 
allel time”; i.e. n interactions among a population 
of n agents corresponds to one unit of time). Let 
c G and C C Denote the probability that the 
PP reaches from c to some configuration c' G C by 
Pr[c^>C']. If Pr[c=4>C'] = 1,^^ define the expected 
time to reach from c to C, denoted T[c => C], to be 
the expected number of interactions to reach from c 
to some c' G C, divided by the number of agents n. 

3 Main Results 

3.1 Impossibility of Sublinear Time Stable Leader 
Election 

We consider the following stable leader election prob¬ 
lem. Suppose that each PP V = {A, <5) we consider 
has a specially designated state £ G /I, which we call 

When the initial configuration to which a transition 
sequence is applied is clear from context, we may over¬ 
load terminology and refer to (co,ci,...) as a transition 
sequence or path. 

Since PP’s have a finite reachable configuration space, 
this is equivalent to requiring that for all x reachable from 
c, there is a c' € C reachable from x. 


the leader state. Informally, the goal of stable leader 
election is to be guaranteed to reach a configuration 
with count 1 of £ (a leader has been “elected”), from 
which no transition sequence can change the count 
of £ (the leader is “stable”). We also assume there is 
a special initial state x (it could be that x = i but it 
is not required), such that the only valid initial con¬ 
figurations i are of the form i(a;) > 0 and i{y) = 0 
for all states y G A\ {ccj. We write i„ to denote such 
an initial configuration with i„(a;) = n. 

Definition 3.1 A configuration y is stable if, for 
all y' such that y=^y', y'(£) = y(£) (in other 
words, after reaching y , the count of£ cannot change); 
y is said to have a stable leader if it is stable and 

y(^) = 1 . 

The following definition captures our notion of 
stable leader election. It requires the PP to be “guar¬ 
anteed” eventually to reach a configuration with a 
stable leader. 

Definition 3.2 We say a PP stably elects a leader 
if for all n G for all c such that in there 

exists y with a stable leader such that c=^y. 

In other words, letting Y denote the set of config¬ 
urations with a stable leader, every reachable config¬ 
uration can reach to Y. It is well-known [8] that Def¬ 
inition 3.2 is equivalent to requiring Pr[i„ = 1 

for all n G 

We note that our PP model captures anonymous 
nodes defined on a complete communication graph. 
Thus, agents in the same state are truly indistin¬ 
guishable. Consequently, our formalism does not dis¬ 
cern whether the agent that becomes the single leader 
stays the leader, or whether the leader state moves 
among agents (reminiscent of token passing). 

Definition 3.3 Let t : —5> R+, and let Y be the 

set of all configurations with a stable leader. We say 
a PP stably elects a leader in time t{n) if, for all 
n G T[i„ => Y] < t{n). 

Our main theorem says that stable leader elec¬ 
tion requires at least linear time to stabilize: 

Theorem 3.4 If a PP stably elects a leader in time 
t{n), then t{n) = f2{n). 

Thus a PP that elects a leader in sublinear time 
cannot do so stably, i.e., it must have a positive prob¬ 
ability of failure. 

The high-level strategy to prove Theorem 3.4 is 
as follows. With high probability the PP initially 
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goes from configuration i„ to configuration x„, such 
that in the sequence (x„) for increasing population 
size n, the count of each state grows without bound 
as n ^ oo (indeed the count of each state grows 
linearly with n); this follows from Theorem 4.3. We 
then show that any such configuration must have an 
“0(l)-bottleneck transition” before reaching a con¬ 
figuration with a stable leader (informally this means 
that every transition sequence from x„ to a configu¬ 
ration y with a stable leader must have a transition 
in which both input states have count 0(1), depend¬ 
ing on the PP but not on n). Since it takes expected 
time f2{n) to execute a transition when both states 
have constant count, from any such configuration it 
requires linear time to stably elect a leader. Since 
one of these configurations is reached from the ini¬ 
tial configuration with high probability, those con¬ 
figurations’ contribution to the overall expected time 
dominates, showing that the expected time to stably 
elect a leader is linear. 


3.2 More General Impossibility Result in Terms of 
Inapplicable Transitions and Dense Configurations 

Rather than proving Theorem 3.4 using the notion 
of leader stability directly, we prove a more general 
result concerning the notion of a set of inapplicable 
transitions. We generalize in two ways. (I) A config¬ 
uration y is stable by Definition 3.1 if no transition 
altering the count of £ is applicable in any config¬ 
uration reachable from y; Definition 3.5 generalizes 
this to an arbitrary subset Q of transitions. (2) The 
valid initial configurations of Section 3.1 are those 
with i„(a;) = n and in{y) = 0 for all y G A \ {a;}; 
Theorem 3.8 generalizes this to any set / of configu¬ 
rations that are all “a-dense”: any state can be ini¬ 
tially present as long as it is present in “large count” 
(at least a constant fraction of the population; see 
Definition 3.6). We also require the allowed initial 
configurations to satisfy a weak sort of “closure un¬ 
der doubling” property: namely, that there is an infi¬ 
nite subset /' C / such that 21' C I. In other words, 
there are infinitely many is/ such that 2i G I. This 
is true if / is closed under addition, for example if 
/ is the set of uniform initial configurations, or if 
/ is the set of all a-dense configurations for some 
a > 0 (since doubling a configuration preserves its 
density). 

Definition 3.5 Let Q he a set of transitions. A con¬ 
figuration y G is said to be Q-stable if no transi¬ 


tion in Q is applicable in any configuration reachable 
from y. 

Let / C hh’' and Q be a set of transitions. Let Y 
be the set of Q-stable configurations reachable from 
some configuration in I. We say that a PP V — (A, <5) 
Q-stabilizes from I if, for any i G /, Pr[i=4>y] = 
1.^^ If J and Q are understood from context, we say 
that V stabilizes. For a time bound fin), we say that 
P stabilizes in expected time fin) if, for all i G / such 
that ||i|| = n, T[i=4>F] < t{n). 

Definition 3.6 Let 0 < a < 1. We say that a con¬ 
figuration c is a-dense if for all s G A, c(s) > 0 
implies that c(s) > a||c||, i.e., all states present in 
c occupy at least an a fraction of the total count of 
agents. 

In order to reason about the behavior of PPs for 
larger and larger population sizes, we consider infi¬ 
nite sequences of configurations that “slice” across 
different population counts n. In other words, these 
sequences consist of configurations satisfying certain 
criteria that are reachable from ever-larger initial 
configurations. (Note that such configurations are 
not reachable from each other since they have dif¬ 
ferent numbers of agents.) When C spans infinitely 
many population sizes, the following definition ex¬ 
presses a basic distinction in how state counts scale 
with increasing population size: 

Definition 3.7 For an (infinite) set/seguence of con¬ 
figurations C, let bdd(C') he the set of states 

{ s G A I (36 G N)(Vc G C) c(s) < 6 }. 

Let unbdd(C') = A\bdd(C'). 

Remark 1 Note that if C = (c^) is a nondecreasing 
sequence, then for all fc G N, there is Cm such that 
for all s G unbdd(cm), Cm(s) > k. (Note that if C 
is not nondecreasing, the conclusion can fail; e.g., 
Cm(si) = m,Cm{s 2 ) = 0 for m even and Cm(si) = 
0 ,Cm(s 2 ) = TO for TO odd.) 

The following is our most general theorem, which 
the rest of the paper is devoted to proving. 

Theorem 3.8 Let V = (A, 6) be a PP, Q be any 
subset of transitions of P, a > 0, and L C be a 
set of a-dense initial configurations such that there 
is an infinite subset L' Q L such that 21' C /. Let 
Y be the set of all Q-stable configurations reachable 

Recall that the condition Pr[i=>y] = 1 is equivalent 
to [(Vc G N^) i=^c implies (3y £Y) c=>y]. 
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from I. Suppose V Q-stabilizes from I in expected 
time o{n). Then there are infinitely many v S K 
such that Vs G bdd(y), v(s) = 0. 

In other words, if some states have “small” count 
in all reachable stable configurations, then there is a 
reachable stable configuration in which those states 
have count 0. A PP V that stably elects a leader is 
a PP in which Q is the set of transitions that alter 
the count of £, / = { i„ | n G N } (note all i„ are 
1-dense and all of I is closed under doubling), Y 
is the set of configurations reachable from I with a 
stable leader, and V Q-stabilizes from I. Hence by 
Theorem 3.8, if V stabilizes in expected time o(n), 
there is a v that is both stable and reachable, where 
v(^) = 0, a contradiction. Thus Theorem 3.4 follows 
from Theorem 3.8. 

We can also use Theorem 3.8 to prove that stable 
leader election requires linear time under the more 
relaxed requirement that there is a set <T C A of 
“leader states,” and the goal of the PP is to reach 
a configuration y in which y(^) = 1 stays 

1 in any configuration reachable from y. Choosing 
Q as the set of transitions that alter that sum. The¬ 
orem 3.8 implies this form of stable leader election 
also requires Q{n) expected time. 

The rest of the paper is organized as follows. 
We conclude Section 3 with two example PPs show¬ 
ing that sublinear time leader election is possible if 
we relax the a-dense requirement in Theorem 3.8— 
showing that this requirement is indeed necessary. 
These examples are particularly useful in discarding 
certain simple proof techniques that naturally come 
to mind. Then Section 4 develops the technical tools, 
which we use in Section 5 to complete the proof of 
Theorem 3.8. Throughout the rest of this paper, fix 
V = (A,(5), a, /, and Q as in the statement of The¬ 
orem 3.8. 


3.3 Why Simple Proofs Fail 

It is tempting to believe that the main theorem fol¬ 
lows by a simple argument based on reasoning about 
the last transition to change the count of the leader. 
Indeed, if we start with more than one leader, and no 
transition rule can produce a new leader, then we can 
easily prove the impossibility of sublinear time leader 
election as follows. To quickly reduce from two lead¬ 
ers to one, the other agent’s state must be numerous 
in the population, so the same transition could occur 
again. This would leave us with no leaders and no 


possibility to make a new leader. However, if tran¬ 
sitions can produce new leaders, then the argument 
cannot reason only about the last transition involv¬ 
ing the leader. We illustrate this using two examples, 
which the authors have found helpful in ruling out 
plausible-sounding but ultimately insufficient ideas 
for proving a negative result. 

We describe two PPs that stably elect a leader 
in sublinear time starting from initial configurations 
that are not a-dense (for a > 0 independent of 
n). (Since the initial configurations are not a-dense 
these PPs do not contradict the statement of our 
main theorem.) In both examples, with high proba¬ 
bility exactly one transition involving the leader oc¬ 
curs. In the first example the transition produces 
precisely one leader in a configuration that previ¬ 
ously had none, whereas in the second example, it 
consumes precisely one leader in a configuration that 
previously had two. (Clearly, these are the only two 
possible forms of the final transition involving the 
leader.) The examples imply that any proof of the 
main result cannot be based solely on reasoning about 
the final transition, but must additionally establish 
that configurations such as the initial configurations 
of these PPs cannot be reached with high probability 
in sublinear time. 

Consider the following PP, with initial configu¬ 


ration i given by i(r) = i(a;) = n — and 

transitions: 

r,r ^ £,k (3.1) 

r,k ^ k,k (3-2) 

X, k ^ k, k (3.3) 

i,i^£,k (3.4) 


Transition (3.1) is the only one possible initially, and 
it takes expected time 6?(n^/^) to occur for the first 
time, producing a single leader. Transition (3.4) en¬ 
sures that if transition (3.1) occurs more than once, 
the PP will eventually stabilize to a single leader. 
However, with high probability transitions (3.2) and (3.3) 
consume all r and x before (3.1) executes a second 
time. After exactly one instance of transition (3.1) 
occurs, let a speed fault denote the event that transi¬ 
tion (3.1) occurs again (this is the same speed fault 
concept studied in ref. [13]). For convenience, for 
state s G A, let s also denote the count of that state 
in the configuration considered. 

Conditioned on the next interaction being non¬ 
null, i.e., it is either a speed fault (transition (3.1)) 
or moves closer to converting all x and r to k (transi¬ 
tion (3.2) or (3.3)), the probability of a speed fault in 
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any particular configuration is < 

k(n-k-i) ’ since r(r — 1) is the number of ways of 
choosing two agents in state r (leading to a speed 
fault), and 2fc(n — fc — 1) is the number of ways of 
choosing an agent in state k and another agent in 
state either r or x, when £ = I (increasing the count 
of fc), and therefore r + a: = n — fc — 1. By the union 
bound, the probability that a speed fault occurs in 
between fc = 1 and fc = n — 1 (at which point tran¬ 
sition (3.1) is disabled and the PP stabilizes) is at 
most 


P/2 


n—2 

E 


^ fc(n — fc — 1) 


= 




Whether or not a speed fault occurs, to pro¬ 
duce £, transition (3.1) must occur for the first time, 
taking expected time 0(n^/^). Let T be the ran¬ 
dom variable denoting the time to stabilization after 
transition (3.1) has occurred for the first time. If 
a speed fault occurs, then transition (3.4) must exe¬ 
cute enough times to reduce ^ to 1, which requires ex¬ 
pected time 0{n) [6]. Thus E[T|speed fault] = 0(n). 
Now to analyze E[T|no speed fault], note that if only 
transitions (3.2) and (3.3) existed, then after produc¬ 
ing a single fc, the expected time for transitions (3.2) 
and (3.3) to convert all x and r into fc would be 
O(logn) (this is known as an “epidemic” [4]). If we 
consider that transition (3.1) is also competing with 
transitions (3.2) and (3.3), and then we condition 
on transition (3.1) not occurring before all r are 
converted to fc, then this conditioning can only re¬ 
duce this expected time. Thus E[rjno speed fault] = 
O(logn). 

Thus, the total expected time to stabilize to a 
single leader is at most 

+ Pr[speed fault] ■ E[Tjspeed fault] 

-|-Pr[no speed fault] • E[Tjno speed fault] 

< 0(n^/^) -b O ■ 0{n) -b 1 • O(logn) 

= logn), 

i.e., sublinear time. 

The above PP uses a non-dense initial configura¬ 
tion since i(r) = o{n). Thus, although it does not di¬ 
rectly contradict the existence of a linear time lower 
bound from dense configurations, it points out that 
any proof based on reasoning about the last transi¬ 
tion to alter the count of £ must disallow the possi¬ 
bility that a leader is elected from some intermediate 


configuration in the manner described above. With 
high probability all states obtain count 17 (n) in a 
constant amount of time;^^ however, it is possible to 
subsequently reduce some states to sublinear count 
after super-constant time. A priori, it is conceivable 
that after, say, O(logn) time, the PP reaches a non- 
dense configuration, with £ = 0 and r Ri similar 
to i above, which would then elect a leader in sub¬ 
linear time by producing a single £ with high proba¬ 
bility. 

This example shows the difference between con¬ 
vergence and stabilization. Assuming no speed fault 
occurs, the PP converges when the first transition (3.1) 
occurs, but it does not stabilize until transition (3.2) 
reduces the count of r below 2, disabling transi¬ 
tion (3.1) from occurring again. 

We now consider another example PP. Even if 
the final change of £ takes it from 2 to 1, it is a pri¬ 
ori conceivable that the final transition r,£ Pi,P 2 
to consume £ has count o{n) of r, so that, although 
a second execution of the transition is possible, the 
second execution requires sufficiently long expected 
time that the system, in the meantime, likely con¬ 
sumes all remaining copies of r, along with a mech¬ 
anism to ensure that a leader is elected even if the 
second leader is also consumed by an r. The fol¬ 
lowing PP achieves this, with initial configuration i 
given by i{£) = 2, i(r) = i(x) = n — i(r) — i{£), 
with transitions 


r,£^r,£' (3.5) 

£', X ^ £',k (3-6) 

k,x ^ k,k (3.7) 

k,r^k,k (3.8) 

£',£'^£,k (3.9) 


An analysis similar to the previous PP shows that 
the expected time to stabilize to £ = 1 is 0(n^/^ log n) 
Informally, transition (3.5) consumes one copy of £ 
after expected time OlrA/"^). Transition (3.6) subse¬ 
quently produces fc in expected time 0(1), and tran¬ 
sitions (3.7) and (3.8) remove all r and x in expected 
time O(logn). With high probability this happens 
before transition (3.5) can execute a second time, 
but if not, then £' = 2, so transition (3.9) guaran¬ 
tees that a single leader is stably elected (as above, if 
this is needed, it requires expected time Q{n), but it 
is needed with such low probability that the overall 
expected time remains sublinear). 


This is the main theorem of [17], of which Theorem 4.3 
is a corollary. 
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In summary, the two PPs above demonstrate that 
the proof cannot be based solely on reasoning about 
the final transition to alter no matter whether that 
transition increases I from 0 to 1 or decreases it from 
2 to 1. 


4 Technical Tools 


4.1 Bottleneck Transitions Require Linear Time 


This section proves a straightforward observation used 
in the proof of our main theorem. It states that, if 
to get from a configuration x S to some config¬ 
uration in a set y C it is necessary to execute 
a transition ri, r 2 —>■ Pi, Vi in which the counts of ri 
and r 2 are both at most some number 6, then the 
expected time to reach from x to some configuration 
in Y is flinjl?). 

Let 6 G N. We say that transition a : ri,r 2 —>■ 
Pi,Pi is a b-bottleneck for configuration c if c(ri) < b 
and c(r 2 ) < b. We say a transition sequence q such 
that X =^q y has a 6-bottleneck transition if some 
transition in g is a 6-bottleneck for the configuration 
where it is applied. 


Observation 4.1 Let 6 G N, x G and Y C 
such that Pr[x=4>y] = 1. // every transition 
sequence taking x to a configuration y € Y has a 
b-bottleneck transition, then T[x^^F] > 2(bAA\)‘^ • 


Proof The probability that a particular transition 
J'l , i '2 Pi,Pi occurs in any configuration c where 
c(ri) < 6 and c(r 2 ) < 6 is at most There 

are no more than \A\^ different transitions.^® By the 
union bound, the probability that in any configura¬ 
tion c, any 6-bottleneck transition occurs is no more 
Thus we can bound the number 


1b^ 


than |4P —7 

I I n(n —1) 

of interactions until the first 6-bottleneck transition 
occurs by a geometric random variable with success 
probability at most whence the expected 

number of interactions until the first 6-bottleneck is 
at least 2 (\>\a[}^ ■ the parallel time is defined 


If ri A ri and c(ri) = c(r 2 ) = b, then the probability 
to pick the first agent in one of the states ri or r 2 is 
and the probability to pick the second agent in the other 
state is so the total probability of both is 

The case for ri = r 2 gives k for the first times for 

the second, resulting in lower total probability ^ . 

With a nondeterministic transition function, the total 
number of transitions would replace the quantity \A\^ in 
the conclusion, but it would remain a constant indepen¬ 
dent of the size of the initial configuration. 


as the number of interactions divided by n, this cor¬ 
responds to 2 {b-jA\)‘i expected time. By assumption, 
the set of configurations Y is reached only after a 
6 -bottleneck transition occurs. Therefore, the state¬ 
ment of the lemma follows. □ 

Corollary 4.2 Let 7 > 0, 6 G N, c G N^, and 
X,Y C such that Pr[c =7- X] > 7, Pr[c Y\ = 
1, and every transition sequence from every x G X 
to some y G y has a b-bottleneck transition. Then 

T[c=^y]>72l5W- 

4.2 Sublinear Time from Dense Configurations 
Implies Bottleneck Free Path from Configurations 
with Every State “Populous” 

The following theorem, along with Corollary 4.2, fully 
captures the probability theory necessary to prove 
our main theorem.^® Given it and Corollary 4.2, The¬ 
orem 3.8 is provable (through Lemma 4.4) using only 
combinatorial arguments about reachability between 
configurations. 

For ease of notation, we assume throughout this 
paper that all states in A are producible, meaning 
they have positive count in some reachable configu¬ 
ration. Otherwise the following theorem applies only 
to states that are actually producible. Recall that 
for a > 0, a configuration c is a-dense if for all 
s G 4, c(s) > 0 implies that c(s) > a||c||. Say that 
c G is full if (Vs G A) c(s) > 0, i.e., every state 
is present. The following theorem states that with 
high probability, a PP will reach from an a-dense 
configuration to a configuration in which all states 
are present (full) in “large” count (/3-dense, for some 
0 < /3 < a). 

Theorem 4.3 (adapted from [17]) LetV = (4,6) 
be a PP and a > 0. Then there are constants e, /3 > 0 
such that, letting X = { x G j x is full and (3- 
dense }, for all sufficiently large a-dense configura¬ 
tions i, Pr[i =5> X] > 1 — . 

The following lemma reduces the problem of prov¬ 
ing Theorem 3.8 to a combinatorial statement in¬ 
volving only reachability among configurations (and 

Theorem 4.3 was proven in a more general model for 
Chemical Reaction Networks (CRNs) that obey a certain 
technical condition [17]. As observed in that paper, the 
class of CRNs obeying that condition includes all PPs, so 
the theorem holds unconditionally for PPs. The theorem 
proved in [17] is more general than Theorem 4.3, but we 
have stated a corollary of it here. A similar statement is 
implicit in the proof sketch of Lemma 5 of a technical 
report on a variant model called “urn automata” that has 
PPs as a special case [5]. 
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the lack of bottleneck transitions between them). In 
Section 5 we will prove Theorem 3.8 by showing that 
the existence of the configurations x„i and ym and 
the transition sequence Pm in the following lemma 
implies that we can reach a Q-stable configuration 
V G N^, where T = unbdd(F) and Y is the set of 
Q-stable configurations reachable from I. 

Lemma 4.4 Let a > 0. Let V — {A, S) be a PP such 
that, for some set of transitions Q and infinite set of 
a-dense initial configurations I, V Q-stabilizes from 
I in expected time o(n). Then for all m G N, there is 
an no such that for all i G I with ||i|| > no, there is 
a configuration reachable from i and transition 
sequence Pm such that: (1) Xm(s) > m for all s G A, 
(2) Xm Ym, where Ym is Q-stable, and (3) Pm 

has no m-bottleneck transition. 

Proof Intuitively, the lemma follows from the fact 
that states Xm are reached with high probability 
by Theorem 4.3, and if no paths such as Pm ex¬ 
isted, then all paths from Xm to a stable configu¬ 
ration would have a bottleneck and require linear 
time. Since Xm is reached with high probability, this 
would imply the entire expected time is linear. 

For any configuration x,„ reachable from some 
configuration in /, there is a transition sequence Pm 
satisfying condition ( 2 ) by the fact that V Q-stabilizes 
from I. It remains to show we can find Xm and Pm 
satisfying conditions (I) and (3). 

By Theorem 4.3 there exist e, /3 (which depend 
only on P and a) such that, starting in any suffi¬ 
ciently large initial configuration i, with probabil¬ 
ity at least I — 2“'^", P reaches a configuration x 
where all states s G A have count at least j3n, where 
n = ||i||. For all i, let Xi = {x | i=>x and (Vs G 
A) x(s) > /3||i||}. Given any m gN, let no be a lower 
bound on n such that: Theorem 4.3 applies for all 
n > no, 1 — 2“*^”° > i, and further no > m//3. Then, 
for alii G I such that ||i|| = n > no, Pr[i= 4 >Xi] > |. 
Choose any n > no for which there is i G / with 
||i|| = n. Then any x^ G satisfies condition (1): 
Xm,(s) > m for all s G A. We now show that by 
choosing Xm from Xi for a large enough n, we can 
find a corresponding pm satisfying condition (3) as 
well. 

Suppose for the sake of contradiction that for 
some m we cannot satisfy condition (3) when choos¬ 
ing Xm as above, no matter how large we make n. 
This means that, letting Y be set of Q-stable config¬ 
urations, for infinitely many i G /, (and therefore in¬ 
finitely many population sizes n = ||i||), all transition 
sequences from Xi to Y have an m-bottleneck. Then 


Corollary 4.2, letting c = i, 7 = |, X = Xi, and 
b = m, tells us that T[i^>F] > 5277 ^^ ^ I2(n), 
a contradiction since P is supposed to Q-stabilize 
from I in expected time o{n), i.e., T[i=7>y] = o(n). 

□ 

4.3 Transition Ordering Lemma 

The following lemma was first proven (in the more 
general model of Chemical Reaction Networks) in [13] 
We provide a proof for the sake of self-containment. 
Intuitively, the lemma states that a “fast” transition 
sequence (meaning one without a bottleneck transi¬ 
tion) that decreases certain states from large counts 
to small counts must contain transitions of a cer¬ 
tain restricted form. In particular the form is as fol¬ 
lows: if A is the set of states whose counts decrease 
from large to small, then we can write the states in 
A in some order di,d 2 , ■ ■ ■ ,dk, such that for each 
1 < z < fc, there is a transition that consumes 
di, and every other state involved in is either not 
in A, or comes later in the ordering. These transi¬ 
tions will later be used to do controlled “surgery” on 
fast transition sequences, because they give a way to 
alter the count of di, by inserting or removing the 
transitions Oj, knowing that this will not affect the 
counts of di,... ,di-i. 

Lemma 4.5 (Adapted from [13]) Let & N 

such that 62 > \A\ ■ bi- Let x, y G such that 
X y via transition sequence q that does not con¬ 
tain a b 2 -bottleneck. Define 

A = { d G A I x(d) > 62 and y(d) < 61 } . 

Then there is an order on A, so that we may write 
A = {di,d 2 , . •., dk}, such that, for alii G , k}, 

there is a transition ai of the form di,Si —>■ Oi,o^, 
such that Si,Oi,o^ ^ {di,... ,di}, and ai occurs at 
least ( 1)2 — jdj • bi)/\A\^ times in q. 

Proof We define the ordering based on increasing 
sets 0 = Z\o C Ai c Z \2 C ...Ak-i C Ak = A, 
where for each 1 < i < k, Ai = Ai-i U {di}. 

We define the ordering inductively “in reverse,” 
by first defining dk, then dk-i, etc. For all 1 < 
i < k, define T>i : —>■ N for all configurations 

c by 'Pi{c) = c(d). d>k is well-defined since 

Ak = A, and (Li is well-defined once we have defined 
dj+i ,. ..,dk, because Ai = A \ {di+i,... ,dk]. 

Because y((i) < 61 for all d G A, it follows that 
^^i(y) ^ i ■ bi < [A] • 61 . Recall that x(d) > 62 for 
all d G A. Let r be the suffix of q after the last 




12 


David Doty, David Soloveichik 


configuration c' along q such that ^i(c') > 62- Then 
in all configurations c in r, c{d) < &2 for all d G Ai. 
Because d>i{c') > 62 but •d>i{y) < |yl| -61, r contains a 
subsequence u of transitions, each of which strictly 
decreases d>i, and the total decrease in ‘Pi over all of 
u is at least (62 “ |^| ■ &i) between configurations c' 
and y. 

Let a : ri, r2 ^ J5i,P2 be a transition in u. Since 
a strictly decreases Pi, ri G Ai or r2 G Ai; as¬ 
sume without loss of generality that ri G Ai. Fur¬ 
ther, since u does not contain a 62-bottleneck, and 
all configurations c along u have c(d) < 62 for all 
d G Z\i, for a not to be a 62-bottleneck, we must have 
r2 ^ Ai. Since exactly one state in Ai decreases its 
count. Pi ^ Ai and p2 ^ Ai, or else a would not de¬ 
crease Pi. Let di = ri, Si = r2,Oi = pi, and o' = P2- 

Then a decreases Pi by exactly 1 . Since there are 
at least 62 — |vl| • 61 instances of such transitions in u, 
and there are at most \A\'^ total types of transitions, 
by the pigeonhole principle at least one transition 
type must repeat in u at least (62 — |vl| • bi)/\A\^ 
times. □ 

It is instructive to observe how Lemma 4.5 can 
fail if the transition sequence q contains a 62-bottleneck. 
Consider the linear-time leader election PP given by 
the transition £,i — >■ i, f with initial configuration 
X = {n£} and final configuration y = { 1 £, (n — 1 )/}. 

In this case, A = {£}, but once the count of £ drops 
below 62, subsequent transitions are 62-bottlenecks. 
Thus, the hypothesis of the lemma is not obeyed, 
and indeed, there is no transition £, s —>■ o, o' such 
that i ^ {s, o, 0'}. 

5 Proof of Theorem 3.8 

Let /' C / be an infinite subset of / such that 
2 /' C I. Recall Lemma 4 . 4 . We use it (letting I 
in Lemma 4.4 be /') to construct infinite sequences 
(xm) and (ym) of configurations and (pm) of paths 
as follows. Let m gN. Lemma 4.4 tells us that there 
is an no such that for all i G I' with ||i|| > uq, there 
is a configuration x^ reachable from i and transition 
sequence Pm such that: (1) Xm{s) > m for all s S T, 
( 2 ) Xm Ym, where y^ is Q-stable, and ( 3 ) Pm 

has no m-bottleneck transition. 

By Dickson’s Lemma (Lemma 2 . 1 ) there is an in¬ 
finite subsequence of values of m for which both (x^) 
and (ym) are nondecreasing. Without loss of general¬ 
ity, we take (xm), (ym), and (pm) to be these subse¬ 
quences. Let A = bdd((ym)) and F = unbdd((ym)) = 
yl\Z\. Note that since each ym G Y (the set of stable 


configurations reachable from some i G I), we have 
that bdd(F) C bdd((ym)). Thus, we prove the theo¬ 
rem by showing that for infinitely many y G Y, for 
all s G A, y{s) = 0. 

Note that stability is closed downward: subsets of 
a Q-stable configuration are Q-stable. For any fixed 

G < ym for sufficiently large m, by the 

definition of F (the states that grow unboundedly 
in ym as TO —)• 00 ). All ym are Q-stable. Thus any 
configuration G is automatically Q-stable. 
This is why Claims 1, 2, and 3 of this proof center 
around reaching configurations that have count 0 of 
every state in A. 

Overview of Claims 1-3. Recall the path Xm Ym 

from Lemma 4.4. Intuitively, Claim 1 below says that 
because this path is m-bottleneck free. Lemma 4.5 
applies, and its transitions can appended to the path 
to consume all states in Z\ from ym, resulting in a 
configuration Zm that contains only states in F. If 
this is possible directly as stated, this would corre¬ 
spond to the formal claim that Xm =^pm ym =^p’^ ^m, 
where Zm G contains no states in A. However, we 
do not know how to prove this directly (although it 
may be true). Instead, we show in Claim 1 that ym 
can reach to such a. G if some extra agents 
in special states are supplied, i.e., that there exists 
e G (the extra agents) such that ym + e =4> z^. 
By additivity Xm-l-e^>p^ ym + e, so Xm-l-e=>Zm. 

So where will these extra agents come from? Al¬ 
though we talk about them as if they are some¬ 
how physically added, in actuality, we’ll start with a 
larger initial configuration and “guide” some of the 
agents to the desired states that make up e. Claims 2 
and 3 explain how this happens. 

Claim 2, also relying on Lemma 4.5, is a way 
to produce the states corresponding to e needed for 
Claim 1. Claim 2 states, intuitively, that any such 
e G can be produced from Xm, as long as this is 
done “in the context” of extra states (corresponding 
to p G N^). However, unlike the extra states e G 
as used in Claim 1, the states in p are “recovered”. 

To understand why Claim 2 is useful, it is help¬ 
ful to look how the proof of Claim 3 works. The 
initial configuration i in Claim 3 can be split into 
two halves i' where i = 2i'. We have i'^^Xm, so 
i 2xm • The proof of Claim 3 works by employing 
Claim 2 to produce e from one copy of Xm (while 
at the same time turning Xm into G N-^), using 
the other copy of Xm as the “context” p that allows 
Claim 2 to work. Once e is produced and the sec¬ 
ond copy of Xm is recovered. Claim 1 is then used to 
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show that Xm + e => . In other words, having al¬ 

ready eliminated F states from the hrst copy of x^, 
turning it into w^, we use e to eliminate F states 
from the other copy of x^, turning it into . Thus 

i = 2i' => 2xm => Xm-bw^-be z^-bw^ = G 

We argued above that is Q-stable, proving 
Theorem 3.8. 

Claim 1 There is e € sueh that for all large 

enough m, there is z^ G sueh that x^-be =b z^. 

Example. We hrst illustrate Claim 1 through an ex¬ 
ample. Dehne a PP by the transitions 

b,a^f,c (5.1) 

b,c^f,a (5.2) 

a,c^f,f (5.3) 

f,c^f,b (5.4) 

(5.5) 

For convenience, for state s € A, let s also denote the 
count of that state in the conhguration considered. 
Let conhguration x^ be where / = 100, a = 100, 
b = 100, c = 100. Suppose a transition sequence 
Pm without an m-bottleneck (m = 100) takes the 
PP from Xm to ym, in which a = 3, 6 = 2, c=l, 
and / = 394. Then in the language of Lemma 4.5, 
A = {a, 6 , c}; these states go from “large” count in 
Xm to “small” count in y^,. 

Our strategy is to add interactions to pm in or¬ 
der to reach a conhguration z^ with a = b = c = 0. 
There are two issues we must deal with. First, to get 
rid of a we may try to add 3 instances of (5.1) at 
the end of Pm- However, there is only enough 6 for 2 
instances. To eliminate such dependency, in Claim 1, 
whenever we add a transition 6 , a —^ /, c, we add an 
extra agent in state 6 to e. (In general if we consume 
r 2 by adding transition ri,r 2 —>• pi,p 2 , we add an 
extra agent in state ri to e.) Second, we need to pre¬ 
vent circularity in consuming and producing states. 
Imagine trying to add more executions of (5.1) to 
get a to 0 and more of (5.2) to get c to 0; this will 
fail because these transitions conserve the quantity 
a-be. To drive each of these states to 0, we must hnd 
some ordering on them so that each can be driven 
to 0 using a transition that does not affect the count 
of any state previously driven to 0 . 

Lemma 4.5 gives us a way to eliminate such de¬ 
pendency systematically. In the example above, we 
can hnd the ordering di = a, d 2 = c, and d^ = 6 , 
with respective transitions (5.1) to drive a to 0 (3 
executions), (5.4) to drive c to 0 (4 executions: 1 to 


consume the 1 copy of c in y^, and 3 more to con¬ 
sume the extra 3 copies that were produced by the 
3 extra executions of (5.1)), and (5.5) to drive 6 to 
0 (6 executions: 2 to consume 2 copies of 6 in y^, 
and 4 more to consume the extra 4 copies that were 
produced by the 4 extra executions of (5.4)). 

Proof (of Claim 1 ) Intuitively, the proof works as 
follows. Recall that x^ jm and Pm does not 

contain an m-bottleneck. The goal is to get from con¬ 
hguration Ym (which may be positive on some ele¬ 
ments of A) to z^ (which is 0 on all elements of A). 
(Recall that A and F partition the set of states A.) 
We will show that we can append to the end of Pm 
transitions : di, Si —>■ Oi, o', for i G {1,..., fc}—in 
that order—such that for alH, di G and Oi,o' ^ 
{di,..., di}. We use Lemma 4.5 to hnd the neces¬ 
sary transitions. We add enough Ui transitions to 
consume all copies of di. (We’ll use Ci to indicate the 
number of transitions added.) However, this will also 
consume copies of Si, so we add more agents in state 
Si to e to account for this. (We’ll use ei to represent 
the additional copies of Si added, with eventually 
e = ®*') Once we have added enough ai tran¬ 

sitions to make the count of di equal to 0 , by the 
fact that for all j, 0 {di ,... ,dj}, subsequently 

added transitions aj for j > i will not produce di (so 
its count will stay 0 ), nor will it require consuming 
di (so the transitions will be applicable). However, 
prior to reaching the point where we add ai transi¬ 
tions, if di = Oj or o' for j < i, then the excess copies 
of di generated by the extra aj transitions mean that 
we may need to add more than ym(di) copies of ai 
to consume all the copies of di. The resulting conhg¬ 
uration will be z^ G N-^. 

More formally, we choose large enough m such 
that the counts of species in A are no longer chang¬ 
ing with m in (ym); thus, the same e and the same 
appended transitions will suffice for all larger m. Re¬ 
call that x,„ Ym and Pm does not contain an 

m-bottleneck. We’ll apply Lemma 4.5 on this path 
with 62 = m and let 61 be the largest count of any 
species in A anywhere in the sequence (ym)- (R nec¬ 
essary, increase m further to ensure 62 > |bl| • 61 .) 
Note that with these parameters, A = bdd((ym)) 
exactly matches the set of states “Z\” dehned in the 
statement of Lemma 4.5. Then this lemma tells us 
that there is an ordering on A, so that we can write 
A = {di,...,dfc}, such that for each 1 < i < k, 
there is a transition ai : di,Si —>• Oi, o' such that 
di € A and Si,Oi,o'i ^ {di,...,di}. (The fact that 
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Si ^ {(ii ,... ,di} is not used in this Claim, but it 
will be essential for Claim 2). 

For all i e {1,..., k}, let 

- Ci= ym{di) + Y^''jl\{cjOj,CjOj} (note that if = 
Oj, then {cjOj,Cjo'j} = { 2 cjOj}) 

^ — {^iSi\ 

Given a transition a and j G N, let j ■ a denote 
the transition sequence consisting of j copies of a. 
For alH e {0,..., k}, define Zm,i as follows, where 

2m,0 ~ Ym" 

2m,0 yCl -ai 2m-,1 

2m,1 d“ ^2 yC 2 -a 2 2m,2 

2m,fc—1 “f ''Ck -otk ^m,k- 

For all i G {0,..., fc}, define pathpm,i inductively 
to be Pm,i-i followed by Ci ■ ai, where the base case 
is Pm,0 ~ Pmi SO that Xm “t“ —^^0 

prove by induction on i that: 

1. Pm,i is a valid transition sequence (i.e., it never 
has a transition in a configuration in which the 
input states are not present), 

2. for all j G {1,... ,i}, 2m,i(dj) = 0, and 

3. for all j G {i + 1,..., k}, Zm,%{dj) = ym(dj) + 
(dj) (in particular, Zm,i(di+i) = 

Ci+i, which is the amount that we remove in path 

Cz+l ■ ). 

The base case z = 0 for 1 follows from the fact 
that Pm is a valid transition sequence to apply to 
Xm- The base case is vacuous for 2. For 3, observe 
that Zm,o = Ym and the sum is empty when z = 0. 

Assume the inductive case for z — 1. We have 
2m,Z = 2m,i-i + 6^ + Ci{oi,o'} - Ci{di,Si} where 
Ci{oi,o'} — Ci{di,Si} is the effect of applying tran¬ 
sition ai for a times. Since = {ciSi\, we have 
2m,Z = 2m,i-i + Ci{oi,o'} - Ci{di}. By the induc¬ 
tion hypothesis 3 for z — 1, Zm,z-i(di) = c^. Since 
di ^ {oi,o'}, this implies that CiOz is a valid path 
(which establishes the inductive case for 1). Further, 
for all j G {1,..., z — 1}, dj 0 {o^, o', di}, which im¬ 
plies that the amount of dj is not changed by ai, and 
by inductive hypothesis 2, for all j G {1,..., z — 1}, 
'^m,i{dj) = 0. Since also Zm,z(dz) = 0, we establish 
the inductive case for 2. 

Inductive case 3 is proven as follows. Let j G {z-|- 
1,..., k}. Induction hypothesis 3 gives that Zm,z-i(dj) 

yrn{dj)+(^Yl,l=\{^i°i^ (dj).Letb = {ciOj,Cio'} 

be the new term in the sum for the inductive case z; 


we must show that Zm,i = 2m,z-i +b. Then h{dj) = 

0 if dj is not an output state of ai, h{dj) = Ci if dj 
is exactly one output state, and h{dj) = 2ci if dj 
is both output states. Thus after applying a ■ ai to 
2m,z-i + Gi to result in configuration Zm,i, we have 
increased the count of dj by exactly h{dj), result¬ 
ing in Zm.i(dj) = Ym(dj) + cio}}^ (dj), 

proving the inductive case for 3. 

To complete the proof we let z(0 = Zm,fc, for the 
final value k. Inductive case 2 on this final value k, 
shows that for all j G {1,..., fc}, Zm,k[dj) = z^{dj) = 

0, thus proving that z^ G N'^. Finally, note that 
e = statement of the claim. □ 

Intuitively, Claim 2 below works toward generat¬ 
ing the vector of states e that we needed for Claim 1. 
The “cost” for Claim 2 is that the path must be 
taken “in the context” of additional agents in states 
captured by p. Importantly, the net effect of the path 
preserves p, which will give us a way to “interleave” 
Claims 1 and 2 as shown in Claim 3. 

Claim 2 For all e G there is p € such that 

for all large enough m, there is G such that 

p -1- Xm p + e. 

Example. Recall the example above illustrating Claim 1 
Claim 2 is more difficult than Claim 1 for two rea¬ 
sons. First, we need to be able to obtain any counts 
of states a, b, c, f in e, and not only ensure that 
a = b = c = 0. Second, we no longer have the free¬ 
dom to consume extra states (i.e., e in Claim 1). 
Note that p cannot fulfill the same role as e did in 
Claim 1 because p must be recovered at the end. 

For concreteness, suppose e consists of a = 7, 

6 = 2, c = 0, / = 3. To start with, note that han¬ 
dling state / in e is easy: recall / is in T = A \ Z\ 
and is present in “large” count in ym- We can sim¬ 
ply “siphon” the required number of agents in state 
/ into e leaving the rest as 'wff. For the rest of e, 
recall that ym has a = 3, 5 = 2, c=l. How can we 
generate an additional 4 copies of a? Note that all 
transitions preserve or decrease the sum a -I- 5 -|- c. 
Thus we cannot solely add interactions to Pm to get 
to our desired e. The key is that we can increase 
a by removing existing interactions from pm that 
consumed it. Indeed, Lemma 4.5 helps us by giving 
a lower bound on the number of instances of tran¬ 
sitions (5.1),(5.4),(5.5) that must have occurred in 
Pm- (Note that in Claim 1, we didn’t need to use the 
fact that these transitions occurred in pm- Now, we 
need to ensure that there are enough instances for 
us to remove.) 
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In our case, to increase a by 4, we can remove 4 
instances of interaction (5.1) from pm, resulting in 
a = 7,6 = 6, c = —3.^^ To get c = 0 as desired, we 
can remove 3 instances of transition (5.4), resulting 
ina = 7,6 = 3,c = 0. Finally, we add 1 instance of 
transition (5.5) to get a = 7, 6 = 2, c = 0 as desired. 

Note that unlike in Claim 1, we have more po¬ 
tential for circularity now because we cannot add 
the other input to a transition as e. For example, we 
can’t use transition (5.3) to affect c because it affects 
a (which we have previously driven to the desired 
count). Luckily, the ordering given by Lemma 4.5 
avoids any circularity because the other input and 
both of the outputs come later in the ordering. 

Importantly, as we remove or add interactions 
to Pm, we could potentially drive the count of some 
state negative—but only temporarily because the fi¬ 
nal counts (w^ -b e) are nonnegative. Performing 
these interactions in the context of more agents (p) 
ensures that the path is valid. 

Proof (of Claim 2) Define G by e‘^(d) = e(d) 
for all d G A and e"^(s) = 0 for all s G P, and define 

G similarly, so that e = -be^. (Recall that 

A and P partition the set of states A.) We proceed 
by proving the following claim, which focuses only 
on the part of e, and which additionally ensures 
that grows on all states in F as m ^ oo: 

(*) For all G there is p G N^, such that 
for all large enough m, there is G N^, such that 
p -b Xm =b p -b -b and unbdd((w^)) = F. 

Supposing this claim is true, it is easy to com¬ 
plete the proof of Claim 2 by handling positive 
as follows. Since unbdd((w^)) = F, given any e^, 
is non-negative for large enough m. Then 
p-bx„= 4 >p + w^-be"^ = p-b (w^ - e^) -b e. In 
other words we apply claim (*) to produce e^, and 
then “siphon” the remaining states from to 
produce e = -b e^, which maintains the required 
conclusions of Claim 2 with (w^—e^) replacing w^. 

We now show how to prove the above claim (*). 
Recall that x^ =^pm Ym and Pm does not contain 
an m-bottleneck. Intuitively, we will try to modify 
Pm so that in the end we get exactly of Zi. As in 
the proof of Claim 1, we will use the fact that pm 
does not contain an m-bottleneck and Lemma 4.5 to 

Note the need for p to ensure that the total count 
never goes negative. In writing “a = 7, fe = 6,c = —3”, 
we are examining the effect only on ym of modifying pm, 
but in applying the lemma, the starting configuration is 
Xm -b p, not merely x^, so the actual count of each state 
s will be p(s) larger than just stated. 


find transitions affecting A in a non-circular man¬ 
ner. However, unlike in Claim 1, we cannot simply 
consume additional states (i.e., e G in Claim 1) 
to ensure that the count of the “other input state 
Si' does not become negative. Rather, to increase 
the amounts of Si we will remove certain transition 
instances originally in pm- It turns out that even 
with removing transitions, our modification to Pm 
may still temporarily take certain states negative if 
we start from x^. However, executing the path in 
the context of p provides “buffer room” to ensure 
that no counts ever go below zero. 

More formally, as in the proof of Claim 1 apply 
Lemma 4.5 with b 2 = m and let bi be the largest 
count of any state in A anywhere in the sequence 
(ym) • The lower bound on 62 = m is determined be¬ 
low (“bound on the amount of fixing”). Lemma 4.5 
tells us that there is an ordering on A, so that we can 
write A = {di ,..., dk}, such that for each 1 < i < k, 
there is a transition : di,Si —>■ o^, o' such that di G 
A and Si, Oi,o\ ^ {di ,..., di}, and ai occurs at least 
{b 2 — \A\-bi)/\A\^ times in pm■ Note that the final con¬ 
dition was not necessary to prove Claim 1 since its 
proof only added transitions to Pm- However, since 
the current proof removes transitions as well, we re¬ 
quire this condition to ensure that there are suffi¬ 
ciently many existing instances to be removed. 

We iteratively fix the counts of states in A one 
by one, in the ordering given, i.e. we first adjust 
Pm to fix di, then we fix d 2 (while showing that 
the fixing of d 2 cannot affect the count of di in 
any configuration, so it remains fixed), etc. We start 
with e^(s) = ym(s) for s G A and e^(s) = 0 
for s G F, and w^q(s) = ym{s) for s G F and 
= 0 for s G A. Having fixed di,... ,di-i, 
and obtaining new (which could now 

be negative) such that agrees with the desired 
over di,... ,di-i, we process di as follows. If 
6 i = e‘^{di) — ef_i{di) < 0: add 5i instances of tran¬ 
sition ai at the end of the transition sequence. If 
di > 0: remove 5i instances of ai where they occur 
in the transition sequence; property (3) ensures that 
q contains enough instances of ai (see below). Let ef 
be the counts of the states in A at the end of this 
path. By property (2) and (3), adding or removing 
instances of ai affects only the counts of states in F 
and di+i ,... ,dk- Since we fix these counts in the pre¬ 
scribed order, when we are done, the counts of each 
di is equal to its count in (ie = e^), while 
counts of elements of F have been altered (letting 
^). We now claim that for large enough 
m, f. is nonnegative, and that p can be indepen- 
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dent of TO. Finally, we derive a bound on the number 
of transition instances that we may need to remove, 
which determines another bound on to (ie 62 ) to en¬ 
sure that there are enough instances by property (4) 
above. 

Note that the amount of fixing we need to do 
only depends on the desired as well as on the 
counts of A states in y^. Because are nonde¬ 
creasing, and A = bdd((ym)), for large enough to, 
the counts of A states in y^ stop changing, and 
the amount of fixing depends only on the desired 
e^. This implies that the p we need to add to en¬ 
sure that no counts go negative can be indepen¬ 
dent of TO. Further, for large enough to, the differ¬ 
ence between and y™ is independent of to, and 
thus unbdd(w^) = unbdd(ym) = T, as needed for 
claim (*). This also implies that for large enough to, 

fc is nonnegative. 

Bound on the amount of fixing: We now derive a 
bound the number of transition instances that must 
be added/removed, in order to justify that this bound 
depends only on e, but is independent of to. De¬ 
fine the quantity ct = max |ym(d) — e'^((i)| as 

the maximum amount that any state in A deviates 
from its desired count. We add or remove at most 
1 ^ 11 < Cb instances of ai, which affects the count of 
states in F U {d 2 ,... ,dk} by at most 2 cb (it could 
be 2 per transition if the transition is ai : cii,s —>■ 
s', s' for some state s' € A). Thus, |i 52 | < Ch + 2|(5i| 
(the original Cf, error plus the additional error from 
altering the number of a\ transitions). In general, 
l^il < Cb + 2(|(5i| -I- ■ • • -I- |^i-i|) < Thus if 

we let TO = &2 > fc ■ we will have 

enough transition instances by property (4) to re¬ 
move ((62 - |^| • &i)/|4lp = 3*“^Cf,). □ 

Claim 3 For infinitely many i € I, there is G 

such that i . 

Proof Intuitively, Claim 3 follows by expressing i = 
2i' where i' G F and i'=>Xm, so 2i'=4>2xm. We 
then apply Claim 2 to one copy of x,„ (with the 
other Xm playing the role of p) to get to a configu¬ 
ration with the correct e for Claim 1, and then apply 
Claim 1 to remove all states in A. 

Choose TO large enough to satisfy the conditions 
stated below as they are needed. By Claim 1, there 
is e S and G N'^ such that x^ -|- e z^. 

Apply Claim 2 on e (making sure to is large enough 
to satisfy the claim on e). Thus, there is p S and 

G such that p-fx™ =>p^ p-f -be. If to is 
large enough that x„ > p, then x^-l-x^ =^Pm + 


-l-e. Then, by Claim 1, x^ + -be -b 

z^. To complete the claim, we let = w^ + z^. □ 

Finally, Theorem 3.8 is proven because is te¬ 
stable and it contains zero count of states in Z\. To 
see that is Q"Stable recall that < y^' for 
sufficiently large to' since F = unbdd((ym)) and 
contains only states in F. Since stability is closed 
downward, and y^' is Q-stable, we have that is 
Q-stable as well. 
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